BENEFITS OF THIS EXCEL DOCUMENT
- Clear explanations of the revenue and cost drivers--complete with usage chains, token economics, and fundraising effects--provide an immediate blueprint for strategic planning and investor storytelling.
- Real‑time linkage between every assumption and the key return metrics (IRR, MoIC, cash runway) lets founders and investors see the impact of pricing, growth, or cost changes in seconds.
- It distills a complex AI‑SaaS business model into a single input sheet and four calculation tabs, so users can explore five‑year financial outcomes without wrestling with hidden formulas.
SAAS EXCEL DESCRIPTION
Editor Summary
An Excel pro forma model (XLSX) for AI agentic platforms that models revenue as a cascade from customer logos to seat subscriptions, usage-based billing, token consumption, and a one-time implementation fee.
Read more
Includes modules for seat-based ARR, usage billing (tasks → reasoning steps → tokens billed per 1,000 tokens), and cost drivers (token COGS, infrastructure, support, royalties, payment fees). Developed by Jason Varner | SmartHelping (10+ years, 750+ clients). Sold as a digital download on Flevy. Used by CFOs, product leads, founders, and investors for pricing and unit-economics analysis.
Use this model when you need to forecast ARR and usage revenue for an AI product that charges both seat subscriptions and token-based usage — for example when designing pricing, projecting investor returns, or deciding whether to self-host inference.
CFOs and finance leads stress-test ARR impact from new logos, seat expansion, churn, and annual price escalators to project EBITDA and cash balance.
Product managers compare light-touch copilots versus heavy autonomous agents by varying tasks/day, reasoning steps, and model size to observe usage revenue changes.
Founders and FP&A teams model runway improvements from one-time implementation fees versus ongoing churn effects on recurring revenue.
VP Engineering evaluates the breakeven of API vs self-hosted inference by simulating wholesale API rates versus in-house inference cost per 1,000 tokens.
The driver-based cascade from logos → seats → tasks → tokens reflects standard unit-economics and ARR forecasting practice used in investor-ready financial models.
The model treats revenue generation as a cascade that begins with customer acquisition and ends with token consumption. It starts with seat‑based subscriptions—the predictable, high‑margin engine that investors look for in ARR‑oriented businesses. Customer logos grow at tier‑specific rates, each logo carries an assumed number of active seats, and every seat is priced monthly. Because these subscriptions renew automatically and churn is modeled explicitly, you can isolate the lift that new sales, seat expansion, and annual price escalators contribute to recurring revenue.
Layered on top of seats is usage‑based billing, which converts the depth of product engagement into dollars. The template follows a simple chain: seats drive tasks per day; each task contains a configurable number of reasoning steps; each step consumes tokens; and total tokens are billed at a price per 1 000 tokens. This structure lets you test everything from light‑touch copilots to heavy autonomous agents by varying task frequency, complexity, or model size. A one‑time implementation fee rounds out the top line, injecting early cash to offset acquisition costs without muddling recurring revenue metrics.
Costs mirror the revenue logic to reveal true unit economics. The largest variable expense is token cost of goods—either a wholesale LLM API rate or an in‑house inference cost per 1 000 tokens—which scales in lockstep with usage. Infrastructure and cloud expenses (compute, storage, bandwidth) also ride on API‑call volume but can be optimized through caching, batching, or reserved‑instance commitments. Customer success and support costs grow with logos or seats depending on your service model; automation percentages help you see how self‑serve tooling flattens that curve. Finally, optional buckets such as model‑licensing royalties and payment‑processor fees capture other usage‑linked outflows.
Because revenue and direct costs share the same volumetric driver—tokens—gross margin is extraordinarily sensitive to the spread between price per token and cost per token. A 20 percent reduction in token COGS therefore has the same bottom‑line impact as a 20 percent increase in usage price, but is usually easier to achieve through vendor negotiations or model optimization. Implementation fees, although non‑recurring, materially improve early runway by front‑loading cash, while support‑cost efficiency dictates how much of that margin you ultimately keep.
Taken together, these drivers create a living model where you can dial up growth, experiment with price points, or project cost‑curve improvements and instantly see how they ripple through EBITDA, cash balance, and investor returns. The clarity around "what moves what" not only guides operational decisions—like when to switch from API calls to self‑hosted inference—but also strengthens your narrative with investors who want to understand precisely where scale economics kick in.
Got a question about the product? Email us at support@flevy.com or ask the author directly by using the "Ask the Author a Question" form. If you cannot view the preview above this document description, go here to view the large preview instead.
TOPIC FAQ
What are the primary revenue drivers in an AI agentic platform pro forma?
Revenue is modeled as a cascade: customer logos grow by tier, each logo carries active seats priced monthly, seats drive tasks per day, tasks comprise configurable reasoning steps that consume tokens, and tokens are billed at a price per 1,000 tokens; a one-time implementation fee is modeled separately as top-line cash.
How should I model usage-based billing for copilots or autonomous agents?
Represent usage as a chain: seats → tasks per day → reasoning steps per task → tokens consumed per step, then multiply total tokens by price per 1,000 tokens. Vary task frequency, complexity, or model size to simulate light-touch versus heavy-agent scenarios and observe usage revenue changes.
What cost buckets must be included to measure token-based unit economics?
Include token COGS (either wholesale LLM API rate or in-house inference cost per 1,000 tokens), infrastructure and cloud (compute, storage, bandwidth) that scale with API calls, customer success/support that scale with logos or seats, and optional model-licensing royalties and payment-processor fees, all tied to token or usage volume.
How sensitive are gross margins to token price and token cost?
Gross margin is highly sensitive to the spread between price per token and cost per token; a 20 percent reduction in token COGS produces the same bottom-line impact as a 20 percent increase in usage price, illustrating the leverage of token cost improvements on margin.
What should I look for in a pro forma model when evaluating pricing strategies for a seat-plus-usage AI product?
Look for a driver-based structure that lets you vary logos growth, seats per logo, tasks/day, steps per task, token price and token COGS, and that surfaces impacts on EBITDA, cash balance, and investor returns; the AI Agentic Platform Pro Forma: Usage, Seat & Setup Revenue models these drivers explicitly.
How can I decide whether to keep using a hosted LLM API or switch to self-hosted inference?
Compare token COGS under wholesale API rates versus projected in-house inference cost per 1,000 tokens and simulate effects on gross margin and cash runway; the model lets you toggle API vs self-hosted assumptions to reveal where scale economics make self-hosting advantageous per cost per 1,000 tokens.
Which metrics should I highlight when presenting unit economics of an AI agentic product to investors?
Emphasize ARR from seat subscriptions, recurring revenue contribution from seat expansion and annual price escalators, usage revenue from tokens, token gross margin (price minus COGS), churn/renewal impacts, and early cash from implementation fees to show runway and investor-return drivers.
How should I model a one-time implementation fee without distorting recurring revenue metrics?
Treat the implementation fee as a separate, non-recurring top-line cash inflow used to offset acquisition costs and improve early runway; keep recurring metrics (ARR, churn, seat pricing, and token-based usage) modeled independently so recurring revenue and margin dynamics remain clear.
Source: Best Practices in SaaS, Agentic AI, Integrated Financial Model Excel: AI Agentic Platform Pro Forma: Usage, Seat & Setup Revenue Excel (XLSX) Spreadsheet, Jason Varner | SmartHelping